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Let (W, W') be an exchangeable pair. Assume that 

E(W - W'\W) = g(W) + r(W), 

where g(W) is a dominated term and r(W) is negligible. Let G(t) = 
J * g{s)ds and define p(t) = Cie _c ° G '*' ) , where Co is a properly cho- 
sen constant and ci = 1/ e~ c ° G ^ dt. Let Y be a random variable 
with the probability density function p. It is proved that W converges 
to Y in distribution when the conditional second moment of (W — W') 
given W satisfies a law of large numbers. A Berry-Esseen type bound 
is also given. We use this technique to obtain a Berry-Esseen error 
bound of order l/\/n in the non-central limit theorem for the magne- 
tization in the Curie- Weiss ferromagnet at the critical temperature. 
Exponential approximation with application to the spectrum of the 
Bernoulli-Laplace Markov chain is also discussed. 
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2 S. CHATTERJEE AND Q.M. SHAO 

1. Introduction and main results. Let W be the random variable 
of interest. Typical examples of W include the partial sum of independent 
random variables and functionals of independent random variables or de- 
pendent random variables whose joint distribution is known. Since the exact 
distribution of W is not available for most cases, it is natural to seek the 
asymptotic distribution of W with a Berry-Esseen type error. Let (W, W) 
be an exchangeable pair. Assume that 

(1.1) E(W -W'\W) = g(W) + r(W), 

where g(W) is a dominated term while r(W) is a negligible term. When 
g{W) = XW, and E((W' — W) 2 \W) is concentrated around a constant, 
Stein's method for normal approximation shows that the limiting distri- 
bution of W is normal under certain regularity conditions. We refer to 
Stein (1986), Rinott and Rotar (1997), Chen and Shao (2005) and refer- 
ences therein for the general theory of Stein's method. The main aim of this 
paper is to find the limiting distribution of W as well as the rate of conver- 
gence for general g. The key step is to identify the limiting density function. 
As soon as the limiting density function is determined, we can follow the 
idea of the Stein's method of exchangeable pairs for normal approximation. 
Let 

(1.2) G{t)= [ g(s)ds and p{t) = Cl e- C ° G ^ , 

Jo 

where cq > is a constant that will be specified later and c\ = 1/ e~ c ° G ®dt 
is the normalizing constant. Let Fbea random variable with the probability 
density function p. Set 

(HI) g(t) is non-decreasing, and g(t) > for t > and g(t) < for t < 0; 
(H2) there exists C2 < oo such that for all x, 

min (l/ci, l/|coff(x)|)(|z| + 3/ci) max(l, co\g'(x)\) < c 2 . 
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NON-NORMAL APPROXIMATION 
(H3) there exists C3 < 00 such that for all x, 

[1/d, l/\c g(x)\) (\x\ + 3/ Cl )c \g'(x)\ < c 3 . 



mm 



Let A = W — W . Our main result shows that W converges to Y in 
distribution as long as cqE(A 2 \W) satisfies a law of large numbers. 

Theorem 1.1. Let h be absolutely continuous with \\h'\\ = sup x \h'(x)\ < 



00. 



(i) If (HI) and (H2) are satisfied, then 
(1.3) 

\Eh(W) - Eh(Y)\ 



< ||^||| (l±^)^|i _ ( C0 /2)E(A 2 \W)\ + lco(l + c 2 )E\A\ 3 + c c 2 E\r(W)\} 

(ii) If (HI) and (H3) are satisfied, then 
\Eh(W) - Eh(Y)\ 
(1.4) ^ II^I^^T^I 1 - (c /2)E(A 2 \W)\ + ico(l + c 3 )E\A\ 3 
+ ^E((\W\ + l)\ r (W)\)} 



When A is bounded, next theorem gives a Berry-Esseen type inequality. 

Theorem 1.2. Assume that \W — W'\ < 5, where 5 is a constant. If 
(HI) and (H3) are satisfied, then 

\P(W < z)-P(Y < z)\ 
< 3E\l - (c /2)E(A 2 \W)\ + Cl max(l,c 3 )5 + 2c E\r(W)\/ Cl 
(1.5) +5 3 c {(2 + c 3 /2)£| C o 9 (WO| + C1C3/2}. 

We remark that cq can be chosen as follows. In order to make the er- 
ror term on the right hand side of (jl.3p small, it is necessary that E\\ — 
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4 S. CHATTERJEE AND Q.M. SHAO 

{c Q /2)E{A 2 \W)\ and therefore E(l - (c /2)E(A 2 \W)) must be small 
and we should choose Co so that Co ~ 2/E(A 2 ). 

The paper is organized as follows. In section 2, we give a concrete applica- 
tion of our general result to the magnetization of the Curie- Weiss model of 
ferromagnets at the critical temperature, and show that the rate of conver- 
gence achieves 0(n~ l l 2 ). In section 3 we focus on approximation by the ex- 
ponential distribution with an application to the spectrum of the Bernoulli- 
Laplace Markov chain. We present a general approach of Stein's method of 
exchangeable pairs in section 4 and postpone detailed proofs of our main 
results to section 5. 

2. Curie- Weiss Model. Consider the Curie- Weiss model for n spins 
at temperature T, i.e. the probability distribution on {— 1, l} n that puts 
mass 

exv \r^r n — J 

at a G {— 1, l} n , where is the normalizing constant. Let us fix T = 1, 
which is the 'critical temperature' for this model. Now let 

n 

W = W(a) = n- 3 / 4 Y, a i- 
i=i 

This is a simple statistical mechanical model of ferromagnetic interaction, 
sometimes called the Ising model on the complete graph. For a detailed 
mathematical treatment of this model, we refer to the book by Ellis (1985). 

Following ideas in Simon and Griffiths (1973), it was proved by Ellis and 
Newman (1978a, 1978b) that as n —¥ oo, the law of W converges to the 
distribution with density proportional to e -1 ' 4 / 12 . For various interesting 
extensions and refinements of their results, let us refer to Ellis, Newman, 
and Rosen (1980), and Papangelou (1989). 
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NON-NORMAL APPROXIMATION 5 

Below, we present a Berry-Esseen bound for this non-central limit theorem 
obtained via Theorem II .21 Incidentally, Theorem 1 1 , 2 1 can also be used to ob- 
tain similar error bounds for the other limit theorems in the aforementioned 
papers (in particular, the Curie- Weiss model at non-critical temperatures), 
but we prefer to stick to this example only, since it is probably the most 
interesting and relevant one. 

Given a random element a, construct a' by choosing a coordinate / at 
random and replacing 07 by a'j, where a'j is generated from the conditional 
distribution of 07 given (aj)j^j. In other words, we take one step of the 
Glauber dynamics. It is easy to see that (a, a') is an exchangeable pair. Let 
W' = W(a'). We shall show that (see Section 5) 

(2.1) E\E(W - W'\W) - ^n~ 3/2 W 3 \ = 0(n~ 2 ), 

(2.2) E\E{(W - W) 2 \W) - 2n~ 3 l 2 \ = 0{n~ 2 ), 

(2.3) \W - W\ = 0(n~ 3 / 4 ), 
and 

(2.4) E\W\ 3 = O(l). 

Let us now explain roughly how we arrive at (|2.ip . which is the most im- 
portant step. A simple computation shows that at any temperature, 

E(W - W'\W) = n- 3/4 (m - tanh(m/T)) + 0(n~ 2 ), 

where m := n _1 / 4 W is the magnetization. Since m ~ with high probability 
when T > 1, and tanhx = x — x 3 /3 + 0(x 5 ) for x ~ 0, we see that the right 
hand side in the above equation is like n _3//4 m(l — 1/T) when T > 1, while 
it's like n _3//4 m 3 /3 when T = 1. This is what distinguishes between the high 
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6 S. CHATTERJEE AND Q.M. SHAO 

temperature regime T > 1 and the critical temperature T = 1, and this is 
how we arrive at equation (|2,ip . 
Let 

g (w) = \n^' 2 w\ c = n 3 / 2 , 5 = 0(n- 3 / 4 ) 

Then 

Gi(iy) = c / g(t)ttt = ^ 4 /12. 

With the above information, it can be easily checked that by Theorem 11.21 
we get 

Theorem 2.1. Let Y be a random variable with density function 
pH= Cl e- 4 /l 2 , where Cl = ___L__ = ^^y- 

T/ien /or 

(2.5) |P(W < z) -P(y < z)| < c n~ 1/2 , 

where c is an absolute constant. 

Incidentally, after this manuscript was submitted, it was brought to our 
attention that an article by Eichelsbacher and Lowe (2009) was in prepara- 
tion, where the same result (Theorem 12. ip is proved, along the same lines 
as our proof. Eichelsbacher and Lowe (2009) has generalizations of Theorem 
12. II to some other mean-field models. 

3. Exponential limit with application to spectrum of the Bernoulli- 
Laplace Markov chain. In this section we focus on the exponential limit. 
Let (W, W) be an exchangeable pair satisfying 

(3.1) E(W-W'\W) = l/c + r(W), 

where Co > is a constant. Let A = W — W . As a special case of Theorems 
11.11 and 11.21 with a constant function g, we have 
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Theorem 3.1. Let Y have the exponential distribution with mean 1. 
Assume (j3.jp is satisfied. 

(i) Let h be absolutely continuous with \\h'\\ < oo. Then 



(3.2) \Eh{W) - Eh{Y)\ 

< \\h'\\{E\l - (cq/2)E{A 2 \W)\ + cqE\A\ 3 + 3c E\Wr(W)\}. 



(ii) If |A| < 5 for some constant 5, then 

(3.3) \P(W < z)-P(Y < z)\ 

< 3E\1 - {c /2)E(A 2 \W)\ + 5 + 2c 5 3 + 3c E\Wr{W)\. 

We refer to Chatterjee, Fulman and Rollin (2008) and Pekoz and Rollin 
(2009) for other general results for the exponential approximation. 

We now apply Theorem 13.11 to the spectrum of the Bernoulli-Laplace 
Markov chain, a simple model of diffusion, following the work of Chatterjee, 
Fulman and Rollin (2008). Two urns contain n balls each. Initially the balls 
in each urn are all of a single color, with urn 1 containing all white balls, 
and urn 2 all black. At each stage, a ball is picked at random from each 
urn and the two are switched. Let the state of the chain be the number 
of white balls in the urn 1. Diaconis and Shahshahani (1987) proved that 
(n/4) log(2n) + cn steps suffice for this process to reach equilibrium, in the 
sense that the total variation distance to the stationary distribution is at 
most ae~ dc for positive universal constants a and d. In order to prove this, 
they used the fact that the spectrum of the Markov chain consists of the 
numbers 

(3.4) Xi = l- i(2n-i + l)/n 2 for i = 0,l,---,n 

occurring with multiplicities 
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Let I have distribution P{I = i) = m, where 

_ g) - ta 

' " © 

for < i < n. Then Xj is a random eigenvalue chosen from {Aj,0 < 2 < n} 
in proportion to their multiplicities. Hora (1998) proved that W = n\j + 1 
converges in distribution to an exponential random variable with mean 1. 

Noting that nAj + 1 = (n — i)(n + 1 — i)/n := ^j, we can rewrite W = fij. 
To apply Theorem 13.11 we construct an exchangeable pair (W, W') using a 
reversible Markov chain on {0, 1, • ■ ■ ,n} with transition probability matrix 
K satisfying 

n{i)K(i,j) = n{j)K(j,i) for all i,j <E {0, 1, • • • ,n}. 

Given such a K, we obtain the pair (W, W) by letting W = uj where 
/ is chosen from the equilibrium distribution tt, and W = [ij where J 
is determined by taking one step from state / according to the transition 
probability K. As proved in [I], we have (with A = W — W). 

E(W) = 1, E(A 2 \W) = and ^|A| 3 < 6n~ 5/2 . 
Now applying Theorem 13. 1\ we have 

Theorem 3.2. Let Y have the exponential distribution with mean 1 and 
h be absolutely continuous with \\h'\\ < oo. Then 

(3.5) \Eh(W) - Eh{Y)\ < Un" 1 ' 2 

As the difference between W and W' is large when / is small, Theorem 13. II 
doesn't provide a useful Berry-Esseen type bound. However, using a com- 
pletely different approach and some heavy machinery, Chatterjee, Fulman 
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NON-NORMAL APPROXIMATION 9 
and Rollin (2008) are able to show that 

sup \P(W <z)- P{Y <z)\< Cn~ 1/2 , 

z 

where C is a universal constant. 

4. The Stein method via density approach. Let p be a strictly 
positive, absolutely continuous probability density function, supported on 
(a, b), where — oo < a < b < oo. Assume that a right limit p(a+) at a and a 
left limit pip—) exist. Let p' be a version of the derivative of p and assume 
that 

b 

\p'[t)\dt < oo. 

Let Y be a random variable with the probability density function p. In this 
section we develop the Stein method via density approach. The approach was 
developed in Stein, Diaconis, Holmes and Reinert (2004), but the properties 
presented in section 4.2 are new. 

4.1. The Stein identity and equation. A key step is to have Stein's iden- 
tity and Stein's equation. Let V be the set of bounded, absolutely continuous 
functions / with fib—) = f(a+) = 0. Observe that for any / G T> 

E{fiY) + fiY)p'iY)/piY)} 
= E{(f(Y)p(Y))'/p(Y)} 

(4.1) = / (f(y)p(y))'dy = 0. 

J a 

The Stein identity is 

(4.2) Ef'(Y) + Ef(Y)p'(Y)/p(Y) = for / G V. 

For any measurable function h with E\h(Y)\ < oo, let / = fh be the solution 
to Stein's equation 

(4.3) f'(w) + f(w)p'(w)/p(w) = h(w) - EhiY). 
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10 S. CHATTERJEE AND Q.M. SHAO 

It follows from ([43]) that 

(f(w)p(w))' = (h(w) - Eh{Y))p{w) 

and hence 

rw 

f(w) = l/p(w) (h(t) - Eh{Y))p(t)dt 

J a 

(4.4) = -l/p(w) [ (h(t) - Eh(Y))p(t)dt. 



Note that f h G V. 

Consider two classes of density functions. The first one is the family of 
exponential distributions. It is easy to see that if Y has the exponential 
distribution with parameter A, that is, Y is a random variable with density 
function p(x) = Xe~ Xx for x > and p{x) = for x < 0. Then p'(x)/p(x) = 
—A and the Stein identity (|4.2p becomes 



(4.5) Ef'(Y) - XEfiY) = for / G V. 

The second is the family 

ae -\*\ a /P 

p(x) = ——. — — , — OO < X < oo 

FV ' 2/3 1 /«r(l/a) 

where a > 0,/3 > 0. Then p'{x)/p{x) = — ^|x| Q_1 sign(x) and hence the 

Stein identity reduces to 

Ef'(Y)-^E\Y\ a - 1 sign(Y)f(Y) = for / G V. 

4.2. Properties of the Stein solution. In order to determine error bounds 
for the approximation to E(h(Y)), we need to understand some basic prop- 
erties of the Stein solution fh. In the following, we use the notation \\g\\ := 
sup xeR \g{x)\. 

Lemma 4.1. Let h be a measurable function and fh be the Stein solution 
and let F{x) = ff p{t)dt. 
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NON-NORMAL APPROXIMATION 11 
(i) Assume that h is bounded and that there exist d\ > and d2 > 



(4.6) min(l - F(x),F(x)) < d x p{x) 
and 

(4.7) \p'(x)\mm{F{x), 1 - F(x)) < d 2 p 2 (x). 
Then 

(4-8) ||/h||<2di||fc||, 

(4.9) \\f hP '/p\\<2d 2 \\h\\ 
and 

(4.10) ||/£||< (2 + 2d 2 )|H|. 



(ii) Assume that h is absolutely continuous with bounded h! . In addition 
to Q^.<?[ ), d-^. 7[ ), assume that there exist d% and d^ such that 
(4.11) 

min (E\Y\I {Y < X} + E\Y\F(x), E\Y\I {Y>x} + E\Y\(1 - F(x)j)\{p' /p)'\ 
< d 3 p{x) 

and 
(4.12) 

min (E\Y\I {Y < X} + E\Y\F(x),E\Y\I {Y>x} + E\Y\(1 - F(x)j) 
< d^p{x). 

Then if h is absolutely continuous with bounded derivative h' , 



(4-13) ||/£||<(l + d 2 )(l + d 3 )||fc'||, 

(4-14) \\f h \\<d A \\h!l 
and 

(4.15) \\f h \\ <(1 + 4HIK||. 
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12 S. CHATTERJEE AND Q.M. SHAO 

Proof, (i) Let Y* be an independent copy of Y. Then we can rewrite fh 
in |OD as 

f(w) = (l/p(w))E(h(Y) - h(Y*))I {Y<w} 

(4.16) 

= -(l/p(w))E(h(Y) - h(Y*))I {Y>w} , 

which yields 

(4.17) |/H| < 2\\h\\ min(F(w), 1 - F{w))/p{w). 

Inequality (|4,8p now follows from (14.6P and (|4.17p . Inequalities (14.17P and 
fllTD imply |ApVp| < 2d 2 ||/i||, that is flM]), and now (|4TT0|) follows from 
(|Q|> . 

(ii) Let gi(x) =p'(x)/p(x). Recall by (ji~3l) 

(4.18) f" = h'-f'gi-f^. 
To prove (|4.13p . it suffices to show that 

(4-19) IIMII< W|| 

and 

(4.20) \\f'gi\\<(l + d 3 )d 2 \\h'\\. 
By ()4.16p again, we have 

(4.21) \f(w)p(w)\ 

< \\h'\\ min (e(\y\ + |y*|)/ {y < w} ,£(|Y| + \Y*\)I {Y>W} ) 

= \\h'\\ min ^|y[/ {1 r< w} + E\Y\F(w),E\Y\I {Y>w} + £|F|(1 - F(w)). 

This proves f|4. 19j) by assumption (|4.7p . This also proves (|4.14p by (|4.12p . 
It follows from P~T8|) that 

(tf - fg[)p = pU" + f'gi) = f'p + /V = ifp)'- 
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NON-NORMAL APPROXIMATION 13 

Thus 

rw rb 

f'(w)p(w) = (ti- fg[)pdx = - (ti- fg'Jpdx 

J a J w 

and hence 

\f'(w)p(w)\ < \\h'\\(l + d 3 )mm(F(w),l - F(w)), 

which gives (|4.20p as well as (14.15H by (|4.12p and (j4~6j) respectively. □ 
The next lemma shows that (|4.6p - (|4.12j) are satisfied for p defined in 

OO- 
LEMMA 4.2. Let p be defined as in (j 1.2$ . Assume that (HI) and (H2) 
are satisfied. Then Q^.gp - < \4-lty hold with d± = 1/ci, d? = 1, = oi and 
d 4 = c 2 . 

Proof. Let g 2 {t) = c g(t), Gi(t) = c G(t) and F(t) = P(Y < t) be 
the distribution function of Y. We first show that (|4.6|) is satisfied with 
d\ = 1/ci . It suffices to show that 

(4.22) F(t) < F(0)p(t)/cx for t < 
and 

(4.23) 1 - F(t) < ((1 - F(0))/ci)p(t) for t > 0. 

Let = F(t) - (F(0)/ci)p(t) for t < 0. Noting that 

H'(t) = p(t)-(F(0)/ Cl )p'(t) 

= p(t) + {F(0)/ Cl )g 2 (t)p(t) 
= p(t)(l + g 2 (t)F(0)/ Cl ), 

Since g 2 (t) is non-decreasing, if H'(0) > 0, then there is at most one to such 
that H'(t ) = 0; if H'(0) < 0, then H'(t) < for t < 0. Hence H achieves 
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14 S. CHATTERJEE AND Q.M. SHAO 

maximum either at t = or t = — oo. Notice that H(0) = H(—oo) 
H(t) < for all t < 0. This proves K22\\ . Similarly, (H23| holds. 
Next we prove (|4.7p . Noting that p' = —pg 2 , we have for t < 

(4.24) F(t) = f p{s)ds 

J — oo 

< (' MfMf) ds 

-oo 52W 

* -P'(s) 



, 52 (t) 

p(t) 



P(t)/lfl*(t)l- 



"52 (*) 

Similarly, we have 

(4.25) 1 - F(t) < p(t)/g 2 (t) for t > 0. 

Hence (|4.7p is satisfied with d 2 = 1. 
Note that flUD and (@7?D imply that 

(4.26) 1-F(x) <p(x)vam(l/cL,l/\g2(x)\\ for x > 
and 

(4.27) F(x) <p(x)min fl/ci, l/|g 2 (aOl) for x<0 
To verify (14. lip , with x > write 

(4.28) ^|y|/ {y>a;} 

/>oo 

= xP(y>x)+ / P(Y>t)dt 

J X 

< xp(x)imn{l/c 1 ,l/\g 2 {x)^+ J p(t) min (l/ci, l/| ff2 (*)|)^ 

< xp(x)mm (l/ci,l/|p 2 0*0l) +min^l/ci,l/|g 2 (a;)|) ^ p(t)di 

< min (l/ci, l/| 52 (x)|){xp(x) + (1 - F(x))} 

< min (l/c 1 ,l/\g 2 (x)\){xp(x) +p(x)/c 1 } 

< p(x)mm (l/ci,l/\g 2 (x)\){x+ l/ci}. 
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NON-NORMAL APPROXIMATION 15 

Similarly, for x < 

(4.29) E\Y\I {Y<X} <p(x)mm(l/c 1 ,l/\g 2 (x)\ S ){\x\ + l/c 1 }. 

(f¥728|) and P~29|) with x = also give E\Y\ < 2/ci. Hence, recalling (j4T26|) 
E\Y\I {Y>x} + E\Y\(l-F(x)) 
< p(x) min (l/ci, l/\g2(x)\ ) {x + 3/ci} for x > 



(4.30) 



£|Y|/ {y<:c} + £|y|F(x) 

< p(x) min ( l/ci, 1 / 1^2 (^c) I ) {M + 3/ci} for x < 0. 



and 



(4.31) 



Thus, ([4TTT|) holds with d 3 = c 2 by (H2). 

Equations (|4.30[) and (|4.3ip also show that (|4.12p is satisfied with d± = c 2 . 
This completes the proof of Lemma 14.21 □ . 

From the proof of Lemma 14.21 one can see the following remark is true. 

Remark 4.1. Assume that (HI) and (H3) are satisfied. Then Q^.gp - 
l \4- 11 ) hold with d\ = l/ci, d 2 = 1 and d% = C3 and hence Q^.lffp and (J^TT5j) . 



5. Proof of main results. In this section we prove the general error 
bounds (Theorems 11.11 and Theorem 1 1 . 2|> . the result for the Curie- Weiss 
model (Theorem 12. ip . and Theorem 13.11 



5.1. Proof of Theorem 11.11 Let / = fh be the solution to Stein's 
equation (|4.3p . Then 



(5.1) Eh(W)-Eh{Y) = Ef'(W) + Ef(W)p'(W)/p(W) 

= Ef'(W)-c Ef(W)g(W). 
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16 S. CHATTERJEE AND Q.M. SHAO 

Recall A = W — W and observe that for any absolutely continuous func- 
tion / 

= E(W-W')(f(W') + f(W)) 

= 2Ef(W){W - W') + E{W - W')(f{W') - f(W)) 

,o 

= 2E{f(W)E((W -W')\W)} - E(W -W) / f'(W + t)dt 

/oo 
f'(W + t)K(t)dt 
-oo 

where 

K(t) = E{A(I{-A < t < 0} - J{0 < t < -A})|W}. 
Substituting ()5.2[) into f)5. 1[) gives 

Ef{W) - c Ef(W)g(W) 
= Ef'(W) - (c /2){E [°° f'(W + t)K(t)dt - 2Ef(W)r(W)\ 

^ J — oo ' 

= E{f(W)(l-(c /2)E(A 2 \W))} 

/oo 
(f\w)-f\w + t))k(t)dt 
-oo 

(5.3) +c Ef(W)r(W). 

When (HI) and H(2) are satisfied, by Lemmas 14.11 and 14.21 

(5.4) || A|| < C2 \\h% \\f' h \\<(l + c 2 )\\h'\\/c u ||/£||<2(l + c 2 )|K|| 
and hence 

\Ef' h (W)-c Ef h (W)g(W)\ 

< {l + C2)m E\(l-(c /2)E(A?\W))\ 
ci 

+(1 + c 2 )||/i / ||co£?|A| 3 /2 + coc 2 ||/i'||^|r(W)|. 



This proves (|1.3|) . 
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NON-NORMAL APPROXIMATION 17 
Under (HI) and (H3), by Remark O 

(5.5) Wl^ll + Q.JIfc'IM, ||$||< 2(1 + c&)||/i'|| 
From (jHSD, ([ODD and dH5H) it follows that 

|/H| < (l/p(w))\\ti\\mm(E\Y-Y*\I {Y < w} ,E\Y-Y*\I {Y > w} ) 

(5.6) < ||/i'|| min(l/ Cl , l/\g 2 (w)\)(\w\ + 3/ci) < ||fc'||(M + 3/ Cl )/ Cl 

This proves (H3D by ([O]) . (pT5l) and ([521). □ 

5.2. Proof of Theorem 11.21 Since f)1.5|) is trivial when c\c^5 > 1, we 
assume 

(5.7) dc 3 5 < 1. 

Let F be hte distribution function of Y and let / = f z be the solution to 
the equation 

(5.8) f'(w) - c f(w)g(w) = I(w < z) - F(z). 
By §M 

2Ef(W)g(W) + 2Ef(W)r(W) 

/oo 
f'(W + t)K(t)dt 
-oo 

= E J {cof(W + t)g(W + t) + I(W + t<z)-F(z)}K(t)dt 

> EJ c f(W + t)g(W + t)K(t)dt + EI(W < z- 5)A 2 - F(z)EA 2 
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and hence 

EI(W <z- 5)A 2 - F(z)EA 2 

f<5 



< 2Ef{W)g{W) + 2Ef{W)r{W) + -c E [ f(W + t)g{W + t)K(t)dt 
= 2Ef(W)g(W)(l - (c /2)E(A 2 \W)) + 2Ef(W)r(W) 
+c E J {f(W)g(W) - f(W + t)g(W + t)}K(t)dt 
(5:9} J1 + J2 + J3. 

From Lemmas 14,11 and 14.21 again, we obtain 

(5.10) ||/*||<2/ci, H/^ll <2/c and \\f z \\ < 4. 
Therefore 

(5.11) \Ji\ < (A/co)E\l - (c /2)E(A 2 \W)\. 
and 

(5.12) \J 2 \ < (4/ Cl )E\r(W)\ 
To bound J3, we first show that 

(5.13) sup \g(w + 1) - g(w)\ < — — (ci + c \g(w)\). 
\t\<5 Zc 

From (H2) it follows that 

(5.14) \g'(x)\ < 



3c min(l/ci, l/\cog(x)\) 
cic 3 



3c 



max(ci, \c g(x)\) 



< %^( c i + \cog(%)\)- 

3c 
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NON-NORMAL APPROXIMATION 19 
Thus by the mean value theorem 

sup \g(w + t) - g(w)\ 
\t\<5 

< 5 sup \g'(w + t)\ 

\t\<5 

c\c 3 5 . . . . 

< — — (ci + c sup \g(w + t) | 

■3 C \t\<5 

< — — (ci + c \g(w)\ + c sup \g(w + t) - g{w)\) 

6c \t\<5 
= ^r^(ci + c \g(w)\) + sup \g(w + 1) - g(w)\) 

c\c 3 5 1 

< -5 — (ci + cobHI) + o SU P \9( w + *) - 

-JCo o |t|<5 

by ([52!). This proves (pTl3j) . 

Now by (I53U|) and (jSTTgl) . when |i| < 5 

|/MsM -/(«J + tM«; + *)| 



< + t) - /H| + + t)\\g(w + 1) - g(w)\ 

< 4|^H||t| + -^(c 1 + Co bH|) 

ci 2co 

< (4 + c 3 )5\g(w)\ +5cic 3 /c . 



Therefore 



|J 3 | < c (4 + c 3 )5E\g(W)\A 2 + 5c lC3 EA 2 

(5.15) < (4 + c 3 )5 3 E\c g(W)\ + c lC3 5 3 . 

Combining (|OJ) . (pU2l) . ([5TTTD and (l5TT5]l shows that 

J3I(W < z - <5)A 2 - F(z)EA 2 
< (4/co)E\l - (c /2)E(A 2 \W)\ + (4/ Cl )E\r(W)\ 

(5.16) +(4 + c 3 )5 3 E\c g(W)\ + cic 3 5 3 . 
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On the other hand, using F'(z) = p(z) < ci, we have 

EI(W <z- 5)A 2 - F{z)EA 2 

= —(EI(W<z-S)-F(z-S)) 
co 

-— E{I(W <z-S)- F(z))(l - -E(A 2 \W))\ 
co 2 

+-(F(z - 5) - F(z)) 

CO 

> — (P(W < z-S) -F(z-S)) 
co 

(5.17) -^E\1- C ^E(A 2 \W)\- 2 -^, 

co 2 c 

which together with (|5.16p yields 

(5.18) P(W < z-S) -F(z-5) 

< E\l - (c /2)E(A 2 \W)\ + c\5 

+|((4/c )£|l - (c /2)E(A 2 \W)\ + (4/ Cl )E\r(W)\ 

+(A + c 3 )5 3 E\c g(W)\+c 1 c 3 5 3 ) 
= 3E\1 - (c /2)E(A 2 \W)\ + ci<5 + 2coE\r(W)\/ci 

(5.19) +<5 3 c {(2 + c 3 /2)E\cog(W)\ + Cl c 3 /2). 

Similarly, we have 

(5.20) F(z + 5)- P{W <z + 5) 

< 3E\1- (c /2)E(A 2 \W)\ + c x 8 + 2c E\r(W)\/ Cl 

(5.21) +5 3 cq{ (2 + c 3 /2)E\c g(W) | + Cl c 3 /2] . 

This completes the proof of (|1.5p . □ 

5.3. Proof of TheoremUni By (IXTp -M 

£[r(W)| = 0(n" 2 ), 
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NON-NORMAL APPROXIMATION 21 
E\l - (cq/2)E((W - W'f\W)\ = Oin- 1 ' 2 ), 
E\W\ 3 = 0(1) 

Applying Theorem 11.21 gives Theorem 12.11 □ 
We now show that (HDD - (ELD hold. 



Lemma 5.1. With W,W' as in section^ we have 

—3/2 

(5.22) E\E(W -W'\W) W 3 \ < 15n~ 2 , 

3 

(5.23) E\E((W - W') 2 \W) - 2n~ 3/2 | < 15n~ 2 
and 

(5.24) E\W\ 3 < 15. 
Also, obviously, \W - W'\ < 2n~ 3 / 4 . 

Proof. Let m = n~ l Y27=l °* = n ~ l ^W , and for each i, let 

It is easy to see that for r £ {—1, 1} 

(5.25) P(ai = r\a) 



e m,i _|_ g— rrii 



and so 



Hence 



E(a'Aa) = = tanhm,. 

v 41 ' e rrii _|_ e -rrii e mi _|_ g— mj 



n 

E(W-W'\a) = -Yn-^io-t-Eio-'ilo-)) 
i=i 

n 

(5.26) = re~ 3 / 4 m — n~ 7 / 4 tanh rrij . 

i=l 
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Now it is easy to verify that the function 

d 2 — 2sinhx , 

— rtanhx= o — = — 2(tanhx)(l — tanh x) 

dx A cosh x 

has exactly two extrema =Lx* on the real line, where x* solves the equa- 
tion tanh 2 x* = | . It follows that the maximum magnitude of this function 
is 4/3 3/2 . Thus, for all x,y 6 R, 

i i i \ t i \-2i 2(x - y) 2 

| tanh x — tanh y — (x — y) (cosh y) j < 



3 3 /2 



It follows that 

n n 

tanh nij — re tanh m + n~ 1 (coshm)~ 2 Oj 



i=l 

and therefore 



i=l 



< 



2n" 



3 3 / 2 



tanh mi — n tanh m 



i=l 



< Iml + 



2rT 



33/2 



Using this in ()5.26j) and the relation m = n 1 / 4l W, we get 



(5.27) \E(W- W'\a) + n" 3/4 tanh 



m-n 3 / 4 m| < n 2 \W\ + 



2n^ n /4 



33/2 • 

3 

Now consider the function fix) = tanhx — x + Note that f'(x) = 
(coshx) -2 — 1 + x 2 > for all x, and hence / is an increasing function. 
Also /(0) = 0. Therefore /(x) > for all x > 0. Now, it can be easily 
verified that the first four derivatives of / vanish at zero, and for all x > 0, 
d 5 f 16 sinh z x sinrr 4 x . 16 



dx 5 cosh 2 x 
Thus, for all x > 0, 



20 sinh^x 120 £inh^x < 



cosh 4 x 



cosh x cosh x 



< 16. 



,/ \ 16 * 2x 5 
0</(x)<-x 5 = — . 



Since / is an odd function, we get that for all x, 

1 , 1 si 2|x| 5 

tanhx — x H — x < . 

1 3 15 
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Using this information in (|5.27p . we get 

\E(W - W'\a) - -^-m 3 \ < + n~ 2 \W\ + 

Using the relation m = n we get 

n~ 3 ' 2 „^ , 2n~ 2 \W\ 5 , _ 2 . 2n- n / 4 



(5.28) \E{W-W'\a)-—— W 3 \ < J^ + n~ 2 \W\ + 

6 ' 15 6 i l' L 

This implies, in particular, that 

n -3/2 

(5.29) \E((W - W)W 3 ) — E(W 6 )\ 

< 2 "' 2 ^ 8 » + »-^) + 2 "'" / , 4 f |tr|3 . 

15 3 3 / 2 

Thus, 

2„-i/2 evw8\ 

(5.30) £(TU 6 ) < 3n 3 / 2 | J B((^ / -^)^ 3 )| + { ' 

+3n ^ E(w , ) + ^^l, 

Using the crude bound \W\ < n 1//4 , we get 

(5.31) 2n-^E(W*) + 3n - 1/2£;(w4) + 2n-^|3 
< ^^ + 3^) + 2 ^ (w32) 



5 v ; 3V2 

Next, note that by the exchangeability of (W, W), 

E((W'-W)W 3 ) = ^E{(W' -W)(W 3 - W' 3 )) 

= --E((W' -W) 2 {W 2 + WW' + W' 2 )). 

since \W-W'\< 2n" 3 / 4 , this gives 

(5.32) \E((W' - W)W 3 )\ < 6n~ 3/2 E(W 2 ). 

Combining (|53U1) . fOQ , and (J02|), we get 

E(W W 21+ ^W>+ 2£(H/S) 



3V2 y v ' ' 5 ' 
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and therefore, 

5 / 2n _1 \ 
E(W 6 ) < - I 21 + —^-\E{W 2 ) < 36.9245E(W 2 ). 

Since E{W 2 ) < {E(W Q )) 1 / Z , this gives 

(5.33) E(W 6 ) < (36.9245) 3/2 < 224.4. 

and hence (|5.24|) holds. 

Combined with (|5.28p . this gives 

n -3/2 

(5.34) E\E(W -W'\W) W 



31 



3 

„ / 2(224 A) 5 / 6 

< n 



9 /2(224.4) 5 / 6 , s1/fi \ 2n~ n /4 
2 ( ^ + (224.4)V«J + ___ < 15n" 2 . 



By (jSZBD , we have 



1 n 

E((W-W') 2 \a) = -Yin 

71 — ^ 



-3/2 _ 



i=l 



= 2n~ 5/2 ^(1 - tanh(mtCTi)) 

i=l 

n 

= 2n _3/2 - 2n _5/2 ^ <T; tanh m, . 

i=l 

Using | tanhmj — tanhm| < \mi — m\ < n" 1 , we get 

\E((W-W') 2 \a) -2n~ 3 / 2 | 

< 2n~ 5/2 + 2n~ 3/2 m tanh m 

< 2n~ 5 / 2 + 2n~ 3 / 2 m 2 
= 2n" 5/2 + 2n" 2 ^ 2 . 

Using (|5.33p . we get 

E\E{{W - W') 2 \W) - 2n~ 3 / 2 | < 2n~ 5/2 + 2n~ 2 (224.4) 1 / 3 < 15n~ 2 . 
This completes the proof of the lemma. □ 
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5.4. Proof of Theorem 13.11 With p(w) = e w I{ w >o}, f° r given h, let 
fh be the Stein solution given in (|4.4p 



f h (w) = e w (h(t) - Eh(Y))e-*dt = -e w / (h{t) - Eh{Y))e- l dt 



for w > 0. Following the proof of Theorems 11.11 and 11.21 it suffices to show 



By flTTTD, 

\fhH\ < 2\\h\\ min(l - e~ w , e~ w )e w = 2\\h\\ min(l, e w - 1) < 3w\\h\\ 
and by (|OB 

|/fcH| < ||/i / ||e u 'min(-u;e- w + 2(l-e- w ),(u; + l)e- w ) 
< ||/i / ||min(u> + l,2(e t " - 1)) < 3w\\h'\\. 



This proves (I5.35D and hence Theorem 13.11 
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